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Abstract 

In the context of the long-standing issue of mixing in infinite ergodic the- 
ory, we introduce the idea of mixing for observables possessing an infinite- 
volume average. The idea is borrowed from statistical mechanics and appears 
to be relevant, at least for extended systems with a direct physical interpreta- 
tion. We discuss the pros and cons of a few mathematical definitions that can 
be devised, testing them on a prototypical class of infinite measure-preserving 
dynamical systems, namely, the random walks. 

Mathematics Subject Classification: 37A40, 37A25, 82B41, 60G50. 

1 Historical introduction 

The textbook definition of mixing for a transformation T : Ad — > Ad preserving a 
probability measure is 

lim /i(T-M n B) = ^l{A)^^{B) (1.1) 



for all measurable sets A, B C A4 [Wj. Extending this definition to the case where 
/i is a (T- finite measure with fi{Ai) = oo is a fundamental issue in infinite ergodic 
theory. References to this problem can be found in the literature at least as far 
back as 1937, when Hopf devoted a section of his famous Ergodentheorie [H] to an 
example of a dynamical system that he calls 'mixing'. It consists of a set C M^, 
of infinite Lebesgue measure, and a map T : Ai — )■ Ai preserving /i, the Lebesgue 
measure on A^. He proved a property that is equivalent to this one: there exists a 
sequence {pn}nm of positive numbers such that 

lim p„ /i(T-M nB)= fi{A)i^{B) (1.2) 
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for all squarable sets A,B <Z (a squarable set is a bounded set whose boundary- 
has measure zero). 

For a long time, the community did not seem to act on this suggestion, perhaps 
due in part to the impossibility, in any reasonable dynamical system, of verifying 
fll.2p for all finite-measure sets A,B. (This fact, which might not have been clear 
to Hopf himself, can be derived from a famous 1964 paper by Hajian and Kakutani 

m) 

Work in this direction, however, picked up rather intensively in the 1960 's |0r[ 
KB KOI Ell KB Ei], to the point that Krickeberg in 1967 |Kr] proposed (OD as 
the definition of mixing for almost-everywhere continuous endomorphisms of a Borel 
space (A^, /i) (with some extra, inessential, conditions on the sets A, B). Krickeberg 
applied his definition to Markov chains with an infinite state space and an infinite 
invariant measure, which is very interesting in the context of this paper because 
our main examples will be the prototypical infinite-state Markov chains, namely the 
random walks (cf. Sections 12. 2[ HlandlSj). 

Krickeberg's definition has been studied by several researchers since then jFYf 
ITolj ITo2] and, in recent times, it was independently rediscovered by Isola, who 
uses (II. 2p with A = B and calls {p„} the scaling rate [HI |I2]. It failed, however, to 
establish itself as the ultimate definition of mixing in infinite measure. In my opinion, 
this is not so much because of the less-than-perfect requirement of a topological 
structure in a measure-theoretic problem, but rather for its inherent inability to 
describe the "global" infinite-measure aspects of a dynamics: after all, (11. 2p only 
involves finite-measure sets. Related to this, it is unclear how this definition may 
be specified towards stronger and more physically relevant chaotic properties, such 
as, for example, the rate of correlation decay. A little thinking convinces one that 
the speed of convergence in (II. 2p cannot in general be uniform, even for uniformly 
nice sets {A and B can be arbitrarily far from each other so that the l.h.s. of (II. 2p 
is negligible for arbitrarily long times). 

At any rate, by the end of the 1960 's, Krengel and Sucheston |KS] approached 
the problem from a more measure-theoretic point of view and devised the following 
two definitions: A discrete-time, nonsingular dynamical system (A^,/i, T) is called 
mixing if and only if the sequence {T~"yl}„gN is semiremotely trivial for all mea- 
surable A d Ai with /i(v4) < oo; it is called completely mixing if the condition 
holds for all measurable A. (A nonsingular map is one for which ^{A) = implies 
n{T~^A) = 0. As for the definition of semiremotely trivial, which is unimportant 
here, we refer the reader to |KSj .) Both definitions reduce to the standard definition 
(II. ip for maps preserving a probability measure [Su] . 

However, Krengel and Sucheston themselves proved results that imply that many 
reasonable (including all invertible) measure-preserving maps cannot be completely 
mixing |KSt Thms. 3.1 and 5.1]. As for mixing, again specializing to measure- 
preserving maps, their definition is equivalent to 



lim /i(T-"A n 5) = 

n— >oo 



(1.3) 
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for all finite- measure sets A,B |KS1 §2]. This is a rather brutal weakening of (11 ■2p : 
for instance, it would classify a translation in M"^ as mixing! Therefore, however 
illuminating |Saj . the Krengel-Sucheston definitions are of little applicability to 
most simple extensive systems that mathematicians would like to study. 
Aaronson in 1997 jA| §2.5] wrote 

[...] the discussion in |KS] indicates that there is no reasonable general- 
isation of mixing. 

Be that as it may, the drive to produce a general definition of mixing in infinite 
ergodic theory had apparently ceased by the mid 1970's. 

In this paper I will not try to give a universal and firm definition of mixing — 
in which I am not sure I believe myself — for a-finite measure-preserving dynamical 
systems, but rather a few very general notions, that can be completely specified on 
a case-by-case basis, depending on what type of information one wants to extract 
from the dynamical system under scrutiny. 

To do so, I will borrow some ideas and a little terminology from physics, in 
particular from statistical mechanics. The key concept this work is based on is that 
of infinite-volume average, which I illustrate with an example. 

Let us consider a measurable unbounded A C and ask, what is the probability 
that a random x G belongs to A7 Clearly, the answer fully depends on what we 
mean by random. Suppose we specify that random means that each x can be drawn 
with equal probability. Then the question itself no longer makes sense because the 
Lebesgue measure m on M^, which is the only uniform measure on R^, cannot be 
normalized. 

However, remembering that long-gone course in statistical mechanics, one might 
come up with the idea that the sought probability is something like 



provided the limit exists. Of course, such an answer is riddled with issues, but it 
does capture the idea that in physics one only looks at finite quantities. Infinity is a 
mental construct to fit an endless amount of situations, and a finite limit at infinity 
is the formal way to say that most of these situations will look alike. 

More generally, for a given dynamical system {A4, , /i, {T*}), where (A^, ^) is 
a measure space and {T*} is a (semi-)group of transformations Jv[ — )■ jv[ preserving 
the infinite measure ^, we will choose a family of ever-larger sets V , with /u(^) < oo, 
that "approximate A^". Using the language of physics, one might say that choosing 
these sets will define how we measure our infinite system — more precisely, how we 
measure its observables. 

We will deal with two types of observables: The global, or macroscopic, observ- 
ables will be a suitable class of functions F G L°°{Ai, ^ , yu) for which 
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exists. What the above hmit means and what class of functions it apphes to will be 
clarified in Section |21 The local, or microscopic, observables will be essentially the 
elements of L^{M,£^, fi). 

Then, skipping many necessary details and all-important specifications which 
are found in Sections [2] and [3l our notions of mixing will basically reduce to the two 
limits: 

\im JJ{{F oT')G) =JI{F)JI{G), (1.6) 
for any two global observables F, G; and 

lim/z((i^°T*)(7)=7l(F)/i(^), (1.7) 

for F global and g local (with the obvious notation fi{g) := J gdfi). 

To the extent to which the above notions can be made into rigorous definitions — 
and they can, cf. Section [3] — they seem to improve on the attempted definitions 
that we have recalled earlier, chiefly because they involve observables which can 
be supported throughout the phase space (think, for example, of the velocity of a 
particle in an aperiodic Lorentz gas, or the potential energy of a small mass in a 
formally infinite celestial conglomerate, etc.). So they are more apt to reveal the 
large-scale aspects of a given dynamics. 

In particular, (11. 6p may be called global-global mixing, because it somehow ex- 
presses the vanishing of the correlation coefficient between two global observables, 
while (I1.7P may be called global-local mixing, because the coupling is between a 
global and a local observable. The latter notion can be quite useful if one takes 
g > with fi{g) = 1. Then fig, the probability measure defined by dfig := gdfi, 
can be considered an initial state for the system. In this interpretation, the l.h.s. 
of (II. 7p reads fJ>g{F o T*) =: T^fig{F), that is, the expected value of the observable 
F relative to the state at time t, and (II. 7p asserts that such quantity converges to 
7l(-F). Hence, Jt acts as a sort of equilibrium state for the system. 

In Section H] we will apply the above ideas to certain basic yet representative 
examples of infinite-measure dynamical systems, the random walks in Z*^. We will 
specify all the mathematical objects needed to obtain rigorous definitions out of 
(ll.6p -( !T7r|) and we will check that, under reasonable conditions, all these definitions 
are verified. The proofs, given in Section [5l use basic harmonic analysis on groups [R] 
and an estimate for a certain Fourier norm. The latter is presented in the Appendix, 
together with other technical results. 

Acknowledgments. I would like to thank an anonymous referee for pointing out 
a relevant mistake in the first draft of the manuscript. 

2 Infinite-volume limit and observables 

A measure-preserving dynamical system is the quadruple {Ai, ^ , fi, {T^}), where 
is a measure space with the cx-algebra s/, endowed with the a-finite measure n. 
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while {T*}(gG is a group (respectively, semigroup) of automorphisms (respectively, 
endomorphisms) Ai — > M., labeled by the free additive parameter t G G (without 
significant loss of generality, we assume G = N, Z, or M). This means that 

lj{T-'A) = ^{A), VAe^,VtGG. (2.1) 

If G = M, {T*} is called the flow, whereas if G = Z or N, the generator T := is 
called the map, and one usually denotes the dynamical system by {Ai,n,T). 

In this paper we are interested in the case /i(A^) = oo. The measure-theoretic 
properties of dynamical systems preserving an infinite measure are the subject of 
infinite ergodic theory whose most basic definition, perhaps, is the following 

m- 

Definition 2.1 The measure-preserving dynamical system {Ai, £/, /z, {T*}) is called 
ergodic if every measurable invariant set (i.e., any A E ssf such that fi{T~^A A A) = 
Vtj, has either zero measure or full measure (the latter meaning fi{Ai \A) = 0). 

In the case where is a probability measure, the above is one of the several equiv- 
alent formulations of ergodicity, including among others: the equivalence of the 
Birkhoff average with the phase average, for all / G L^{A4,£/,fi) (the definition 
generally ascribed to Boltzmann); the absence of nontrivial integrals of motion in 
L^{Ai, , /i); the strong law of large numbers for the random variables {/ o T*}. In 
infinite measure, these definitions are no longer equivalent and, among those that 
keep making sense. Definition 12.11 is in some sense the strongest. 

A notion that is much harder to transport to infinite ergodic theory, as we have 
discussed in the introduction, is that of mixing. In terms of observables, i.e., scalar 
functions on A^, it reads: 

V/, g G L'iM, /i), hm /i((/ o r)g) = ii{fMg). (2.2) 

i— )-oo 

In other words, the correlation coefficient between the two random variables / o T* 
and g vanishes asymptotically. This phrasing makes it apparent that the notion of 
finite-measure mixing is intrinsically probabilistic. 

2.1 Infinite- volume limit 

The discussion in the introduction suggests that one should find an asymptotic 
decorrelation formula, similar to fl2.2p . which applies to observables that, unlike 
functions, "see" a nonnegligible portion of the space. For this, one needs to define a 
sort of "normalized measure" for these observables. This is how one comes to think 
of averaging a function over M. by means of an infinite-volume limit. 

The idea is borrowed from statistical mechanics where the question arises of 
measuring the intensive quantities of a system (for example, the temperature or the 
density of a gas) . These are represented by sequences of functions defined over larger 
and larger phase spaces, corresponding to larger and larger portions of the physical 
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system, normalized in such a way that their integral converges to a finite number. 
This number is supposed to predict the outcome of an experimental measurement. 

Coming back to our math, we introduce a notation that is going to be used often 
in the remainder: 

£/f := {Ae £^ \ fi{A) < 00} . (2.3) 

Definition 2.2 The family C =0/ is called exhaustive if it contains a sequence 
{KtjneN; increasing w.r.t. inclusion, such that |J„ = -M. 

Definition 2.3 Let 'Y he exhaustive. If (p is defined on £/f and has values in some 
topological space, we write 

lim (j)(V) := lim (j)(V) = L, 

V /"M ve~v ^ ^ ' 

when, for every neighborhood U of L, there exists an M E such that 

V e y,fi{v) > M ^ (j){v) G u. 

This will be called the /i-uniform infinite-volume limit w.r.t. the family Y , or simply 
the infinite- volume limit. 

It is apparent that the above definition depends decisively on the choice of i^. In 
the example discussed in the introduction, cf. (11. 4p . f = {[— r, r]^}r>o, and it is easy 
to think of other exhaustive families of subsets of for which the infinite- volume 
limit for the "probability" of A differs from (II. 4p . 

In general, all the results that we are going to discuss in this paper will depend on 
the choice of Y in Definition 12.31 This is no shortcoming! In fact, we want to retain 
this choice, because this is how we incorporate in the mathematical description of 
an extended system the way we observe the system, that is, how we measure the 
observables of that system. In other words, the choice of Y defines what it means 
to pick a large region of A4, which we assume — or rather declare — represents the 
whole space. 

The first property we assume of our systems may be called 'compatibility of the 
infinite- volume limit with the dynamics': 

(Al) For any fixed t G G, for / M, fi{T-'VAV) = o{fi{V)). 

This means that the scale of the dynamics is local and not global: If \^ is a very 
large set "approximating A^", then its evolution at a fixed time should differ little 
from V, in relative terms. (In statistical mechanics one would say that, over a large 
region of the space, the dynamics can only produce negligible surface effects — we 
will come back to this point in Section \3A] ) 

One can expect (Al) to hold in most situations. It does, for instance, when 
is a metric space such that the //-measure of a ball grows like a power of the 
radius, uniformly in the center of the ball; the dynamics is bounded, i.e., Vt G G, 
3K = K{t) such that, for /i-a.e. x G M., dist(T*x, x) < K] and the elements of Y 
are balls. 
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2.2 A couple of examples 

A very simple example of this setup is the dynamical system defined as follows. Set 

^(x):=|f-^' fo;/^e.[0'l)' (2.4) 
(J, otherwise. 

Then, on = M, define the self-map 

Tx:=J2^i^-j)+J- (2-5) 

The definition is well-posed because, for any given a; G M, only one term of the sum 
is nonzero. Furthermore, each x has three distinct counterimages via T, where the 
derivative of the map is constantly equal to 3. This shows that T is a noninvertible 
map which preserves the Lebesgue measure fi. 

{Ai, fi, T) describes a random walk in Z, in the following sense: Suppose an initial 
condition x G [0, 1) is randomly chosen according to /io, the Lebesgue measure in 
[0, 1) (it is no loss of generality to restrict to [0, 1) because the dynamical system 
is clearly invariant for the action of Z). Then Tx will land in one of the intervals 
[—1,0), [0,1), [1,2), with probability 1/3 in each case. More generally, if [x] := 
max {m G Z \m < x} denotes the integer part of a; G M, we have 

fio {{x G [0, 1) I [T"x] = K, [T"-ia;] = [Tx] = h}) = 3"", (2.6) 

provided that \kj — kj-i\ < 1, for j = 1, . . . ,n (with ko = 0). This implies that, 
using the notation of conditional probability, 

/io ( [T"x] = I [T"-ix] = . . . , [Tx] = k,) = 

/io ( [T"x] = A;„ I [T"-ix] = = 1/3. (2.7) 

Hence, {[T"x]}„ is a Markov chain in Z with same-site and nearest-neighbor jumps, 
each with probability 1/3; namely, it is a (space-)homogeneous random walk. 

As for the choice of Y, the example of the introduction would seem to suggest 
that we pick sets of the type V = [— r, r] (with r G M) or, to fully exploit the Z- 
structure of this dynamical system with no appreciable loss of generality, sets of the 
type V = [—k,k] (with A; G N). Although this is a legitimate choice, we will see 
later that a better option is 

r:={[k,i] \k,ieZ,k <i}. (2.8) 

(Actually, this will be a crucial part of our discussion, and we refer the reader to 
Section 13.11 ) For V = [k,i], n ^ N, and i — k > 2n, it is easy to verify that 

[k + n,e-n] CT-'^V C[k-n,i + n], (2.9) 

which implies (Al). 
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A less trivial system that fits well the framework we are describing is the (ape- 
riodic) Lorentz gas [Ll \ IL2j . In (just to fix the smallest interesting dimension) a 
Lorentz gas is the billiard system in C := \ IJneN ^n, where is a countable 

collection of pairwise disjoint, convex, bounded regular sets. This means that a 
point particle moves with constant unit velocity in C until it hits an obstacle On, 
which reflects the particle according to the Fresnel law: the angle of reflection equals 
the angle of incidence (the modulus of the velocity remains equal to 1). The phace 
space for this system is then M = C x S^, where q E C represents the position and 
V E the velocity of the particle. If T* denotes the flow just described (which is 
unambiguously defined at all noncollision times), it is well known that T* preserves 
the Liouville measure /i, which turns out to be the product of the Lebesgue measure 
on C and the Haar measure on S^. Clearly, save for bizarre situations, /i(A^) = oo. 

Since sufficient conditions for the ergodicity of (A^,/i, T) are known |Llj . and 
since, for the finite- measure version of the Lorentz gas (the so-called Sinai billiard), 
mixing and stronger stochastic properties essentially follow from ergodicity [Sil IBSj 
ICM] , it is of interest to devise one or more sound definitions of mixing for this 
dynamical system. 

As for the exhaustive family Y, in analogy to the previous system, a reasonable 
choice would be 

y := {{C n R) X \R = [a,b] x [c,d], with a < 6, c < d} . (2.10) 

In Section m we will present a third example, which generalizes, in more than one 
way, the first system introduced above. It is a class of invertible dynamical systems 
representing all homogeneous random walks in Z'^. One of its points of relevance is 
that it is designed to retain the most essential features of the Lorentz gas discussed 
above. It is thus a greatly simplified toy model, which we are able to study in depth. 
As a matter of fact, it will be the testing ground for our new notions of mixing, cf. 
Sections H] and [51 

2.3 Global and local observables 

In order to define a surrogate probability measure for our system, we need to declare 
what we intend to measure. In other words, we need to specify the observables, 
namely, the functions Ai — > M which represent the (sole) information that we can 
get on the state of the system. 

In finite ergodic theory, this class of functions is L^{Ai, £/, fi), or sometimes 
L"^ {Ai , ^ , fi) . In virtually every situation, both are amply sufficient to give a full 
description of the state of the system (in fact, quite generally, the position itself 
of X G is given by a finite number of square- integrable functions). This is 
conspicuously not true in infinite ergodic theory. Indeed, the forthcoming discussion 
will try to convince the reader that the choice of the observables is precisely at the 
heart of the matter in infinite-measure mixing (a point that, after all, was already 
contained in |KS] ). 
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We deal with two categories of observables: the global, or macroscopic, observ- 
ables and the local, or microscopic, observables. 

Let us start by introducing the former, whose space we denote by Q. We will not 
give a definition, but rather a presentation of the minimal features that Q should 
have. A precise definition only makes sense on a case-by-case basis and, indeed, 
the choice of Q is part of our description of the system, just like the choice of 
As a typographical rule, we indicate a global observable with an upper-case Roman 
letter, as in F : — > R. 

We require at least the following conditions: 

(A2) g cL'^{M,s^,^). 

(A3) yPeg, 37Z(F):= lim /iy(F) := lim / F dfi. 

V^M V/M ^l{V) Jv 

We call 7i(-F) the average of F (w.r.t. /x and Y). This functional is dynamics- 
invariant: 



Lemma 2.4 Under assumptions (A1)-(A3), fi{F) = fi{F o T*), Vt G G. 



Proof. Using the invariance of /i and then (A1)-(A2), we have 

Applying (A3) gives the assertion. Q.E.D. 

As for the class of local observables, denoted by C, this can be generally taken 
to be V-{M., .9^ , /i). As we will see below, this choice is much less delicate than the 
choice of Q. Nonetheless, some results may require occasional restrictions on , so, 
in the same spirit as (A2), we only require 

(A4) £ C L^(A^,^,/i). 

Local observables are indicated with a lower-case Roman letter, as in (7 : Al — > M. 

3 Definitions and related questions 
3.1 Global-global mixing 

On the basis of Lemma [2!4t one might attempt the following definition of mixing: 



(Ml) VF, G^Q, lim 7l((F o T^)G) = 71(F) 71(G), 

t—>-oo 
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provided that JlHF o T^)G) exists for all t G G, or at least for t large enough. This 
last point represents a problem, because it is not easy, in general, to devise a space 
Q with the property that F,G & Q implies {F o T^)G G Q for all large t (sometimes 
Q is not even T*-invariant, cf. (14.91) later on). 

Generally speaking, there are only two solutions to this problem — which would 
be more honestly described as ways around it. The first solution is to declare that 
this question should be dealt with on a case-by-case basis. The second solution is 
to devise another definition of mixing which just does away with the problem: 

(M2) VF, Geg, lim /xv((F o T')G) = JliF) Jl{G), 

having adopted the notation /iy(-) = fy{-)dfi/ fiiV), as introduced in (A3). The 
above means that, Ve: > 0, 3M > such that, for all t > M and V ^ Y with 
KV) > M, 

|/iv((FoT*)G)-7l(F)7l(G)| <£. (3.1) 

Though cast in a less polished form than (Ml), (M2) still retains a great deal 
of the physical meaning of mixing because it prescribes that, if the region V is 
big enough and the time t is large enough, the two observables F o T* and G are 
practically uncorrelated on V. Actually, in some sense, (M2) is even stronger than 
(Ml), because it implies that, fixed V, (13. ip occurs uniformly in t, for t large. The 
same is not guaranteed by (Ml). See also Proposition 13.11 later on. 

We refer to (Ml) and (M2) as definitions of global-global mixing because they 
consider the coupling of two global observables. 

Let us now focus on a couple of less technical and more substantial questions 
concerning both (Ml) and (M2). The first has to do with the importance on Y 
too, not just Q, for either condition to function as a sound definition of mixing. 

Let us exemplify the question by means of the dynamical system defined by 
(I2.4p -( E3]) . This is a system that should be classified as mixing by any reasonable 
definition (cf. also Section H]). Suppose that for that system we had made the first, 
more restrictive, choice of Y presented in Section [2^2} that is, we had chosen sets of 
the type V = [-k, k], with keN. The function F : R — > R, defined by 

is bounded and has average /i(-F) = 0. Now, fix n > and consider F o T". Given 
the action of the map T and its interpretation as a random walk, it is not hard to see 
that, for X < —n, F{T'^x) = — 1 and, for x > n, F{T^x) = 1 (determining F{T'^x) 
for X G [— n, n) is more complicated and irrelevant here). This and (13. 2p imply that, 
for |x| > n, F{T^x)F{x) = 1. Therefore, for k much larger than n and V = [—k, k], 
/iv((F o T")F) is very close to 1 and indeed Jl{{F o T'^)F) = 1. Since 71(F) = 0, 
this shows that, for F as in ([22]) and G = F, both hmits in (Ml) and (M2) fail 
to hold! 
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But this is reasonable: after all, F has variations (causing it to be nonconstant) 
only on a negligible set, namely {x = 0} C A^. By negligible set we mean, in this 
context, a set whose p-neighborhoods, for all p > 0, have "measure" zero w.r.t. p. 
Therefore this is an instance of the phenomenon, which is well known in statistical 
mechanics, whereby the infinite-volume limit does not see surface effects. (When 
the dynamics is bounded, in the sense specified in the last paragraph of Section 12.11 
the evolution of F can produce no more than surface effects.) 

So we must avoid global observables that have significant variations on negligible 
sets. But this does not mean that we should cherry-pick our observables (although 
there is nothing wrong with that)! In the case at hand, for instance, the unusable 
observables are automatically eliminated by a smarter choice of "f, the one given 
in (12. 8p . That exhaustive family is translation invariant, which seems right for a 
system that is translation invariant. 

An analogous discussion can be made for the other example presented in Section 
12. 2[ the Lorentz gas, and for most extended dynamical systems one can imagine. 

We may conclude that should not be so small as to make the verification 
of (Ml) impossible nor, at the same time, so large as to make the class of global 
observables satisfying (A3) too meager. A happy medium might be for 'f to include 
all the symmetries, or "quasi-symmetries" , of the system and no more. 

The second, and more critical, question concerning (M1)-(M2) is that these 
definitions are completely blind to the local aspects of the dynamics. For instance, 
they are not able to detect an invariant set A, if p(A) < oo. One can easily produce 
a system that is mixing in one of the above senses, but not ergodic as per Definition 
12.11 (For example, take a Lorentz gas and make one scatterer hollow: points inside 
that scatterer will stay confined there, thus breaking ergodicity, while all other 
trajectories will be the same as in the unperturbed system, which one believes to be 
at least (Ml)-mixing, with the right choice of Q and "V.) 

We have no fix for this issue, other then giving a few more definitions which take 
into account local observables as well. 

3.2 Global-local mixing 

The most natural way to couple global and local observables in a definition of mixing 
is this: 

(M4) VF eg,yge £, lim p((F o r)g) = p(F)p(^), 

where we have used the convenient notation fi{g) := g (ip. This is the first notion 
of global-local mixing we give. Since for some systems, such as the Lorentz gas, this 
can be rather hard to prove |L3j, we give a weaker version as well: 



(M3) WF eg, Wg e C with p(^) = 0, lim p((F o T*)c/) = 0. 
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(Notice that (M3) and (M4) are equivalent in ordinary ergodic theory, because one 
can always subtract a constant function from any observable in order to make its 
integral vanish. Not so in infinite measure!) 

We will see momentarily that, for systems for which a uniform version of (M4) 
can be established, it is possible to pass from global-local mixing to global-local 
mixing. So we give one last definition: 

(M5) \/Feg, lim sup -i-|^((FoT*)^7)-7l(F)/i((7)| =0. 

3.3 Summary of assumptions and definitions 

For the convenience of the reader, we summarize here all the assumptions we have 
made and all the definitions of mixing we have given, listing the latter in the correct 
hierarchical order, as clarified by Propositions 13.11 and 13.21 below. 

The following are the minimal requirements on the dynamical system {A4, /i, {T' 
the exhaustive family y, the space of the global observables Q, and the space of the 
local observables £: 

(Al) For any fixed t G G, for 1/ / M, fi{T-'VAV) = o(/i(K)). 
(A2) g C L°°iM,£/,ix). 

(A3) VF e g, 3-p{F) := lim ^F) := Jim -1^^ [ F dfi. 

(A4) C<ZL\M,£^,^l). 

The definitions of global-global mixing are: 
(Ml) VF, Geg, lim Jl{{F o T')G) = Jl{F) Jl{G). 



(M2) VF, Geg, lim o T')G) = Jl{F) Jl{G). 



t— J-oo 



The definitions of global-local mixing are: 
(MS) \/F eg, \/g eC with fi{g) = 0, lim o T^)g) = 0. 



t—^oo 



(M4) VF eg,Wge C, lim /.((F o T')g) = 7l(F)^(^?). 



1 



(M5) VFg^, lim sup ——\ij{{FoT)g)-Jl{F)fi{g)\=0. 



t—^oo 



gec\o 



K\9\) 
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Proposition 3.1 Under all the assumptions made so far, 

(M5) =^ (M4) =^ (M3). 

Furthermore, (M2) implies that the limit in (Ml) holds for all pairs F,G eQ such 
that 'Jl{{F o T^)G) exists for all t large enough. 

Proof. The chain of imphcations is obvious. The last assertion follows directly 
from the definition of the double limit t — > oo, V Ai; cf. Section [XT] Q.E.D. 

With reasonable hypotheses, the strongest version of global-local mixing implies 
the "strongest" version of global-global mixing: 

Proposition 3.2 Suppose that every G E Q can be written ^-almost everywhere as 

G{x) = with Qj G £, 



and, for every V E Y , there exists a finite subset Jy of N, such that 

_ = 0{f,{V)). 

Then (M5) =^ (M2). 




(3.3) 
(3.4) 



Remark 3.3 The hypotheses of Proposition 13.21 above are less cumbersome than 
they appear. One should think of the very common situation in which Ai admits a 
partition of unity, Ylj'^ji^) = ^5 where the ipj are nonnegative integrable functions 
which are roughly translations of one another. In many such cases one can expect 
gj := Gipj to verify all of the above conditions. At any rate, if the gj are all 
nonnegative or all nonpositive, then (13. 4p follows from (13.31) . 

Proof of Proposition 13.21 Fix F,G e Q. We may assume that Ji{F) ^ 0, 
otherwise in the following argument we consider Fc := F + c, where c is a nonnuU 
constant, and easily derive the sought result at the end of the proof. 

Take e > and denote for short F* := F o T* and gv '■= Ylijejv^^' P-^P 
(A3) imply that, for niV) large enough. 



V _ 



-/1(G) 



< 



and, since F is bounded. 



V 



|/iv(G)-7l(G)|<-- 



3 171(F) I 



fxviF'G) - 



e 

< -. 

- 3 



(3.5) 



(3.6) 
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On the other hand, (M5) imphes that 

\fi{F'g,)-Jl{F)i2{g,)\ < mHh^^ (3-7) 
where hm {}{t) = and ^{t) does not depend on j or V. Summing over j G Jy and 

t—^oo 

using (13.41) . one gets 



< m < ' (3, 



for both fi{V) and t large enough. 

Putting together (13. 6p . (13. 8 p and (13. 5p . in that order, we conclude that there 
exists M = M{e) such that, for > M and t > M, 

|/xy(F*G)-7l(F)7l(G)| <£, (3.9) 

which is precisely (M2). Q.E.D. 



4 Mixing for random walks 

In this section we see how the previous definitions play out for a fairly represen- 
tative family of infinite measure-preserving dynamical systems. These are lattices 
of coupled baker's maps which generalize the random walk of Section 12.21 in two 
ways. First and foremost, they represent all the random walks in TJ^. Secondly, 
they are invertible dynamical systems, which can be reduced, for example, to the 
noninvertible dynamical system of Section by a mere restriction of the a- algebra. 

To begin with, let a random walk in if be defined by the transition probabilities 
{Pi3}i3(zzd, with p/3 > and '^pPjs = 1. This means that, if the walker is in a G 7/", 
he will have probability to move to a + /3 in the next step. We introduce some 
notation that will be useful later: 

©:={/3gZ'^ |p^>0}=:{/3(^')},ez^ (4.1) 

is the set of the "active" directions for the random walk, endowed with some enu- 
meration Zat 9 j H- G D. If © is infinite, then N := oo and Zjv := Z+; if © is 
finite, then N denotes its cardinality and Z^v := Z+ fl [1, A^]. 

We view this random walk as a dynamical system (A^, £/, /i, T), where: 

• := Z'^ X [0, If. If we denote 5„ := {a} x [0, 1)^, for a G Z^ then M = 
[j^ Sa can be interpreted as the disjoint union of Z'^ copies of the unit square. 

• ^ is the natural a-algebra for Ai, i.e., the cr-algebra generated by all the 
Lebesgue-measurable subsets of Sa, Va G Z"^, with the natural identification 

Sa ^ [0, 1)^ 
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• H is the infinite measure that coincides with the Lebesgue measure when re- 
stricted to each Sa- 

• In order to define T, set go = and, for k G Z^r, 

k 

Qk ■= Rk ■= [qk-i,qk) X [0, 1). (4.2) 

j=i 

Clearly {Rk}keZN is a partition of [0, 1)^ into adjacent rectangles of height 1 
and width, respectively, 

qk - qk~i = Ppw = /i(i?fc). (4.3) 

For X = {a,y) = {a;yi,y2) G Z'^ x [0, 1)^, let k G Zn be the unique positive 
integer such that y G Rk (equivalently, g^-i < J/i < qk)- One defines 

Tx = T (a; 7/2) := + ; {Vi - qk~i) , P/3('=)y2 + qk-i) ■ (4.4) 

Therefore T is a piecewise linear, hyperbolic, invertible map M. — )■ M. which 
preserves /i (because its determinant, in the variables (yi, ?/2), is 1). Denoting R^^k '■= 
{a} X Rki it is easy to see that T is a Markov map for the partition {Ra,k}a,k- 

Now define ip : Ai — > 7/ a,s il){a,y) = a. It is evident that, having chosen 
X = {0,y) at random in Sq w.r.t. the stochastic process {ip(T"'x)}nm is the 
random walk introduced at the beginning of the section, with initial position in the 
origin. 

Moving on, we need to specify i^, the exhaustive family of sets that determines 
the infinite- volume limit: For all 7 = (71, ... , 7^) G Z*^ and r G Z"*", the set 

B^^r = {a = (ai, . . . , Od) e Z"' | 7i - r < < 7^ + r, = 1, . . . , 4 (4.5) 

is called a square box in if- . Then we pose 

r ■={¥ = 5^,, X [0, \f I 7 G Z^ r G Z+} (4.6) 

Remark 4.1 Clearly, definition (14. 5 p does not capture all the square boxes in Z'^, 
but only those whose side length is odd. This choice, made on grounds of simplicity, 
does not really limit the generality of and indeed the forthcoming results can be 
proven even in the case when (14. 6 p is modified to include all square boxes. 



Lemma 4.2 The dynamical system and the exhaustive family defined above verify 
(Al). 
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When D is finite, the proof of Lemma 14.21 is rather straightforward, along the 
same hnes as (12.91) for the first example of Section 12. 2[ In the general case, it has 
inessential technical complications, so we postpone it to Section ETD of the Appendix. 

In order to introduce our observables, we need some preliminary notation. Let 
^ C be the a-algebra on Ai generated by the partition {Sa} and, as is customary, 
T*^^ = {T-'^A \Ae^}. Then, for i,meZ with i < m, define 

:= T^^ V T^-^^ ■ ■ ■ V T'"+^^ V T™i^. (4.7) 

To fix the ideas, =^o,i is the cr-algebra corresponding to the partition {Ra,k}- More 
generally, consider i < < m: Recalling that N = #D is the number of rectangles 
in each partition {Rk} of S^, one can see that the fundamental partition of ^i^m is 
made up of A^l^l+'" rectangles whose widths are bounded by A™ and whose heights 
are bounded by A'^', where A := max{p^}. Finally, set 

mgN 

-em 

V ^e,m. (4.8) 

-e,meN 

If we exclude, now and for the remainder of the section, the trivial case where = 1 
(i.e., Pfs = 0, V/3 7^ one has that A < 1. This and the previous observation on 

the fundamental sets of ^e^m imply that the sets of i^o,+oo are measurable unions 
of segments of the type {a} x {yi} x [0,1). We call those the local stable mani- 
folds (LSMs) of the system and ^o,+oo the stable a -algebra, also denoted by ja^. 
Analogously, £/u '■= ^-oo,o is called the unstable a -algebra and its sets are measur- 
able unions of local unstable manifolds (LUMs) {a} x [0, 1) x {y2}- Clearly, then. 

We define several classes of global observables: 

■= {F G L'^iM, =^_„^,™,/i) I 37i(F) as per definition (A3)} ; (4.9) 

G ■■= [j Gm. (4.10) 

jtiGN 

where the closure is meant in the L°° norm. One should notice that 
Lemma 4.3 Given the definitions f l4.9p - fl4.10p . 



g=lFe IJ L~(A4,^_^,„,/i) 

L mSN 
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Proof. Let us prove the left-to-right inclusion. Given F G Q and n > 1, there 
exists an F„ G {J^Gm such that — F\\ioo < 1/n. This implies that 

H^n) --< \immifiv{F) < limsup/iy(F) < 7l(F„) + -. (4.11) 

n V/'M V /"M n 

On the other hand, {7l(F„)} is a Cauchy sequence, because {Fn} is. Its convergence 
thus proves the existence of JI^F). 

Conversely, if F is an observable in the closure of [J^ L°°(A^, I^-m,m, yu) for which 
7i(F) exists, then, for any e > 0, there exist m G N and F' G L°°(A^, =^_m,m, /w) 
such that 

\\F' - F\\l^ <e/2. (4.12) 

Denoting F" := E{F\^_m,m), implies that - F"\\lo. < e/2, hence ||F - 

< Furthermore, since Y C ^ C ^-m,m, then Jt^F") exists and equals 
JI{F). This shows that F" G Gm- Therefore F G Q. Q.E.D. 

Remark 4.4 In view of Lemma [4. 3 [ one might wonder why we do not consider the 
more natural class 

Goo ■■= {F G L'^iM,J^,fi) I 371(F)} (4.13) 

instead of G- (The inclusion G C Goo is clearly strict.) The reason, which will 
hardly surprise the "hyperbolic" dynamicist, is that we need to approximate a global 
observable with locally constant functions uniformly over Ai, cf. f l4.9p -( l4rT0|) . At any 
rate, G does not lack generality: for instance, any uniformly continuous F verifying 
(A3) belongs in that class. 

As for the local observables, we also introduce countably many classes: 

Cm := L\M,^-m,m,fi)] (4.14) 

£ := L\M,£^,fi). (4.15) 

Prior to stating the main theorem of this section, we give a lemma that will help 
appreciate its statement. If {a^^^j^j C Z^, the expression span^{a^^^j^j denotes 
the subgroup of all the finite linear combinations of the a^^^ with coefficients in Z. 

Lemma 4.5 Let {/J^^'^jjgZjv C Z'^ and j' G Zjy. Then 

span^{/3(^-)-/3(^-')},-,z, 

does not depend on j' . 

Proof. Section lAl2] of the Appendix. 



Theorem 4.6 Let {A4, , fi,T) be the dynamical system described above. Set v : = 
max{2, [d/2] + 1}, where [■] the integer part of a positive number, and suppose that 



18 



Marco Lend 



(i) the probability distribution p has a finite v-th momentum: 

(a) for a given f E Zjy, span^j/J^^^ - /^^^'^IjgZat = ^'^ ■ 
Then the system is mixing in the following senses: 

(a) (M5) relative to Qm o^nd Cm, for all m G N; 

(b) (M4)-(M3) relative to G and C; 

(c) (M2) relative to G; 

(d) (Ml) relative to Gm, for all m G N, with the extra requirement that F be 
Z'^ -periodic, i.e., F{a,y) = F{0,y), Va G Z'^, \/y G [0, 1)^. 

Remark 4.7 Condition (ii) is essential as it has to do with the irreducibihty of the 



random walk Sp . In fact, assuming for simplicity that G D, if span2(D) ^ Z , 
then the random walk is reducible and the system cannot be mixing in any sense, 
as is ascertained via the global observable 



Fia v) ■= <! ^' " ^ ^^--zy-j. u iq) 

^ ' ■ '0, otherwise. ^ ' ^ 

One last observation that may be of interest is that statement (d) is far from 
optimal. (Ml) holds for a much larger class of global observables, depending espe- 
cially on the distribution {p/?}. In the formulation of Theorem 14.61 however, I was 
mainly interested in a nontrivial case in which (Ml) could be verified easily. 



5 Proof of Theorem 14.6 



Since the proof is rather lengthy, we will divide it into pieces, or stages, as follows: 

Stage 1: We prove (a) using three extra assumptions. 

Stage 2: We remove one of the extra assumptions. 

Stage 3: We remove the remaning two extra assumptions. 

Stage We prove (b)-(d). 
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5.1 Stage 1: Extra assumptions 

Let us initially assume that: 



(El) F e 



'0,+ooi 



(E2) Crn only comprises indicator functions of the type g = 1q, where Q is a 
fundamental set of i^_m,o and Q ^ 5'o; 

(E3) the random walk has zero drift, i.e., /3pj3 = 0. 



From (E2), Q is a rectangle of the type {0} x [0, 1) x /. We denote the length 
of / by 

h:=\I\= fx{Q). (5.1) 
In this setting, (M5) amounts to showing that 

hm / (FoT>rf/i= hm 1 / Fd/x = 71(F), (5.2) 

/i(l^l) n^oo h 



n— >cxD 



rpnc 



uniformly in Q, that is, with a speed of convergence that does not depend on the 
choice of /. (In the above we have used the invariance of fi and the fact that 
^{\g\) = h = fi{g) .) Achieving this will be Stage 1 of the proof. 

Q can be thought of as partitioned into {Q fl -RojIjeZjv, which are rectangles of 
width Pij(j) and height h. By construction, T acts on each such rectangle by stretch- 
ing it horizontally and shrinking it vertically by a factor p^^jf, and then mapping 
the resulting rectangle, of width 1, rigidly into Sj^u). 

Iterating this procedure n times, we obtain 

T"Q = U g,,,...,,„ := U X [0, 1) X (5.3) 

which is a disjoint union of A^" thin rectangles of width 1. Each Qjj^^,„j„ is the set 
of all the points T^x for which the trajectory of x E Q followed the itinerary S'^oi), 
>S'^Oi)+/3{i2), ■ • • , where 

:= P^''^ + P^''^ + ■■■ + P^'"^- (5.4) 
Therefore, recalling fl5.ip . the height of Qj^^...j„ is 

^il,...Jn := l^jl,...,Jnl = KQjl,-,jJ = hppUl)PpU2) ■ ■ ■ P^Un). (5.5) 
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Since F G we can write, with a slight abuse of notation, F{a;yi,y2) = Fa{yi). 
Then 



5Z / / ^o.,^.....,Ayi) dyi dy2 = 

i,...,j„£Zr^-^^h.-dn ^0 

^Pl3(h)P(3U2)-- - PpUn) faj^^...,j„, (5.6) 



Jl,...,J„G/]v 

having used (15.31) . flS.Sp and the following definition: 



•1 

fa.= I F^iy,)dy,. (5.7) 







In view of f l5.4p and fl4.ip . another way to write fl5.6p is 

/ Fdfl = h ^ P/3(i)P/3(2) ■ ■ ■ //3{i)+...+/3W. (5.^ 



5.2 Fourier analysis 

The technical backbone of the proof is Fourier analysis on Z'^, for which we proceed 
to establish the necessary notation [Kl |R]. Let a := {aa}a€Z'' ^ ^*(Z'^;C), with 
s G [1, oo]. Its Fourier transform is denoted 

a{e) = a{ei, . . . , ^rf) := ^ a„ e*"'^ = J2 ^r(arO^+...+a,e,) ^ ^5 

where z is the imaginary unit and ^ = (^i, . . . , ^d) G T'^ := {R/27rZy. If s > 2, (ISlQj) 
must be intended in the weak sense, cf. f lS.lip . The corresponding inverse transform 
is given by 

a„ = /" a{9)e-"'-Ue, (5.10) 

where d6 := {2Ti)~'^d9. The Parseval formula, in this setting, reads as follows: If 
h = {ba} G C), with 1/s + 1/s' = 1, then 

(a,6) := V a^6„ = [ ^9)b{9)d9, (5.11) 

the bar denoting complex conjugation. Another standard result that we need is the 
duality between convolution and product: If 

{a*b)a ■■= "Y apba-fs = "Y aa-i3bi3 (5.12) 

/3eZ'' /3GZ'* 
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is well-defined in the proper or weak sense, then 

{^){0) = a{e)b{e). (5.13) 

Applying these concepts to our case, we see that / := {/q} € by construction 
(because F E Q C L°°(A^)) so / is a distribution on T''. On the other hand, 
P •= {Pa} G which makes p continuous (it is actually much more than that, 
cf. Section [5^ . In particular, since p is a probability distribution on Z"^, p{0) = 1. 
Defining 

;= p * . . . * p (5.14) 
n times 

we can rearrange (15. 8p into 

]- [ Fdfi = {f,p^-^)= [ W)i^\o)de= [ W)r{0)de, (5.15) 

where we have used (]5.13p . ( 15.14^ and the fact that fa G K. Hence Stage 1 of the 
proof, cf. (15. 2p . reduces to showing that 

lim (/,p(")>=7Z(F). (5.16) 

n— s-oo 

Now recall definition (14. 5p . For r G N, let 

(r) ._ / (2r + 1)-^ if a G B^y, 



" ' I 0, otherwise, 

define a function q^'^^ : — > M. Its Fourier transform is easily computed to be 

.X Asin((r + l/2)^i) 
(r + 1/2) sm&'j 

In view of (14. 6p . let us denote Va,r '■= -Ba,r x [0, 1)^. Since F verifies (A3) and 
the infinite- volume limit is /x- uniform (cf. Definition 12. Sp . we have that 

If 1 
lim — — — - / F du = lim -j > fg = 



r— >oo 



lim (/*g('^))„ = 71(F), (5.19) 

uniformly in a. (We have used notation (15. 7p .) This, in turn, yields 

lim = lim = 7l(F) (5.20) 

uniformly in n, because p*^"^ is a probability distribution on Z'^. (In the first equality 

we have used the fact that qa^ = g^T^, by construction.) This fact implies that, for 
any sequence {r„} C N with r„ — )■ oo, 

lim (/, * = 71(F). (5.21) 
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Therefore, comparing (15.211) with fl5.16p . we see that Stage 1 is achieved once we 
have shown that there exists a diverging sequence {r„} of natural numbers such that 



hm = 0. 

n— )-oo 

For an a fortiori choice of let us set 

which gives 



A convenient estimate in view of fl5.22p is 



l(/.9"">l < 



< C 



A 



(5.22) 

(5.23) 
(5.24) 

(5.25) 



where the norms || ■ ||^ and || ■ are introduced in Section [A. 31 of the Appendix 
(cf. in particular Lemma [A. 21 and notice that || ■ Hz/i; < || • H//")- 
Therefore f l5.22p will be proved once we establish that 



< 



,C«) 



(n) 



4 = 1 



L2 



— )■ 0, as n — )■ cxD, 



(5.26) 



for a suitable choice of {r„} in definition f l5.23p . Here di := d/ dOi, acting on functions 
T'^ — > C. 



5.3 Properties of and p 

In view of the above goal we need to study some properties of the functions g^*") and 
p. 

Remark 5.1 None of the proofs in this section will use (E3). 

As a start, let us notice that g^*") is C°° by construction and p is C^, with u > 2, 
by hypothesis (i) of Theorem 14.61 

Lemma 5.2 Fix r G Z+. On gM(0) = p{0) = 1 and, for 6^0, 

|gM(e)|<l, |p(e)|<l. 



Proof. We first prove the assertions on p. Since p is a probability distribution, 
p(0) = 1 and \p{0)\ < 1, \/9. Suppose by contradiction that 36*' G T'^, 9' ^ 0, such 
that \p{e')\ = 1, that is. 



(5.27) 
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for some a G M. Since p^o) > and '^jPpij) = 1, necessarily e^^''^^^^ = e*", Vj G Z^r, 
whence, Vj,j', 

Let us define the character (i.e., the homomorphism — > C C) r]0>{a) := 
e*^ It is easy to see that rjQ/ is not the trivial character (which instead corre- 
sponds to 6*' = 0; this is a particular case of the so-called Pontryagin Duality [Rl 
Thm. 2.1.2]). On the other hand, fl5:28|l reads r/e'(/3(^) - l3^^'^) = 1 and hypothesis 
(ii) of Theorem 14.61 implies that rjei = 1, thereby creating a contradiction. 

As for the assertions on g(^), there is nothing more to prove, because satisfies 
the same properties as p, insofar as the above argument is concerned. Q.E.D. 

Notational convention. From now on, C will denote a generic universal constant. 
This means that its actual value will vary from formula to formula but will never 
depend on n, r, or 9. 

Lemma 5.3 // we think ofp as a periodic function on (as opposed to a function 
on T'^), there exists a neighborhood lA of 6 = and a positive constant C such that, 

for eeu, 

< 1 - c\9\^ = i-c {ef + --- + el) . 



Proof. As 6* 0, 

p{9) = l + tv9 + 0{\9\^), (5.29) 
where f G M'^ is the drift of the random walk, defined as 

The Lagrange remainder in fl5.29p holds because p is at least C^. Hence |p(6')p = 

1 + (9(|^p), which implies the assertion. Q.E.D. 

Lemma 5.4 Regarding g^**) as a periodic function on W^, one has that the following 
expansion, 

d / oo \ 

i=i \ i=i / 

holds uniformly on the compact subsets ofM.'^. Furthermore, as r oo, 

Ur)\<C- 



Proof. By the factorizability of q^^'\ it is sufficient to treat the case d = 1. 
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Since q^^^ is compactly supported in Z*^, q^^'\6) is an entire function of 6^ which 
we have aheady calculated in (15. 171) . Its Taylor expansion at the origin is even and 
its (even) terms are 

This gives ^o('^) = 1 and the desired estimates. Q.E.D. 

We estimate the norms in the r.h.s. of f l5.26p by splitting the corresponding 
integrals into two parts: one over which is the ball of center and radius 
j7,-(i-e)/2 Qj^(^ other over T'^ \ Here £ > is a small constant to be 

fixed later and n is a large integer. 



Lemma 5.5 There exists a k > such that, for n sufficiently large, 

max Ipl" < e"'^"'. 

Proof. By elementary Taylor approximations, Lemma |5T3] implies that there exists 
a constant k > such that, for 6 

|p(e)| < e-"l'l'. (5.32) 
For n large enough, by Lemma [5. 2 [ the continuity of p and the compactness of T*^, 

max \p(e)\ = max \p(e)\ < e"^""'^' (5.33) 

(in the last inequality we have used (I5.32p . which applies because, for n large, Bn C 
U). Q.E.D. 



5.4 End of Stage 1 

Let us begin to attack f l5.26p by estimating First of all. 



L'HT'i\t3n) 



< {2nY2e 



(5.34) 



by Lemma [5.21 applied to and Lemma [5.51 — see f l5.24p . Again, applying Lemma 
15.21 to both g(^) and p, 

^ (5.35) 



< 



Li(B„) n'^{i-e)/2 



where C does not depend on r, that is, 
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As for the remaining terms in the r.h.s. of fl5.26p . clearly 



k+l=u 



(5.36) 



Let us estimate fl5.36p on T'^\Bn- Fixing / > 1, one verifies by repeated differentiation 
that 



dip 



10 = 1 jl+...+jw=l 



[n — w] 



■r-'"idi'p)idi'p)---id'rp}, (5.37) 



where the combination of the two sums above represents the sum over all the parti- 
tions {ji, j2, • • • ,jw} of I (i.e., ju > 1 and ji + j2 + ■ ■ ■+ jw = I), with any cardinality 
w, and Cj^ j^ G N is a combinatorial coefficient independent of n. Since p G C^, 
all the derivatives that appear on the r.h.s. of f l5.37p are continuous functions of 6. 
Therefore, by Lemma 15. 5[ 



max la^p^l < Cn'e-''^"-^)'. 



(5.38) 



As for the other factors in the r.h.s. of f l5.36p . for A; > 1 we use definition fl5.17p to 
estimate 



max 



(For A; = 0, we already know that 1 — g^**) is bounded.) Using fl5.38p -( !5l39|) into 
f l5.36p . we get 



(5.40) 



L2{T<i\B,0 

which tends to zero, as n — ?■ oo, provided that r = grows no faster than a power 
of n. This will be verified a fortiori, see fl5.4ip . 

The estimation of the last term, namely ||<9j''(7*^")||l2(s„)5 is the most delicate, 
therefore we organize most of the computations involved in the following 



Lemma 5.6 For v G Z+ (not necessarily as in the statement of Theorem \4-6\ ), 
assume p G . Then take any sequence of positive numbers A„, with A„ — )■ 0. For 
n large enough and uniformly for 



one has 



max 



d^gi-) 



< (7^2i.^{-2+2£+i^{l+£))/2_ 
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Once Lemma 15.61 is established, which will happen momentarily, we can finally 
choose both the sequence {r„} and the parameter e: the former will be any diverging 
sequence such that 

rn < Cn\ (5.41) 

whereas 

£<min<^-, — JTT r • (5-42) 



3' 2(5z/ + 2 + rf/2) 

The condition e < 1/3 is necessary in order to apply Lemma [5.61 to r = r„. In fact, 
via f l5.4ip . r„n^*^^^^)/^ < CrT^^^'^'^'^l'^ =: A„. The other inequality is needed when we 
use the assertion of Lemma 15.61 to estimate 

< (7^"2+i.~d/2+(5;/+2+rf/2)£_ (5.43) 

For e so small that (5z/ + 2 + d/2)e < 1/2 the above vanishes, as n — )■ oo, because 
u — djl < 3/2 (for d = 1, it equals 3/2; for > 2 it equals 1 or 1/2, depending on 
d being even or odd). 

In conclusion, estimates fOIj) . fCTI]) and (K^ prove (K^ . thus (lE^ . 

thus f l5.16p and, lastly, ( 15. 2p . uniformly in Q as requested. This ends Stage 1 of the 
proof. 

Proof of Lemma 15.61 Throughout the proof it is understood that 6 e Bn, i.e., 
1^1 < n^^^^^^^'^. The condition on r ensures that 

\^j{r)e^^\ < Cr2%-(i-^)-'' < CAl^, (5.44) 

which implies that, for n so large that A„ < 1, the expansion of Lemma [5.41 and all 
those derived by it by differentiation w.r.t. 6i are meaningful. More importantly, as 
n — > oo and uniformly in r as described in the statement of the lemma, each such 
expansion can be approximated by a convenient upper bound on its first term. We 
indicate this with the symbol ~: for example, 

1 - g{^)(e) ~ 16(^)^1^ < Cr^n-'^^-'^ (5.45) 

and, for /c > 1, 

r^^.Mrm j {k + l)\\^ik+i)/2m\ < Cr'=+ln-(l-)/^ if A; is odd, 
^ ' ^ \ k\ \^k/2ir)\ < Cr\ if A; is even. 

(5.46) 

Furthermore, (E3) is equivalent to Vp'(0) = 0, which implies 

\dipi9)\ < C\9\ < Cn-^^-'^/\ (5.47) 
We will proceed by induction on u > 1. When u = 1 our function reads 

di^) = -di^) + (^1 - dip. (5.48) 
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Hence, from (I5.45p -( l5^47|) . and Lemma [5^ applied to p, 



max 



< Cr''n^~^+^'^'\ (5.49) 



proving the assertion for z/ = 1. 

Now we assume the assertion with v and set out to prove the one with z/ + 1. In 
practice, this means that increasing the order of the derivative by one must worsen 
the inequahty of Lemma [5.61 at most by a factor r^n'^^+^^/^. 

We apply di to (I5.36p . On the r.h.s., di can either hit 9f (1 — g(^)) or d\-p^. Let 
us analyze the two cases separately. 

In the first case, assuming for the moment > 1, we see via fl5.46p that 

r)^+i.T)rm < / = (^^'^' if k is odd, 

<i W S I ^^fc+2^-(i_e)/2 ^ (^^fc^^2^-(i-.)/2^ if is even. ^^"^^^ 

The terms within parentheses in the above represent the estimates for d^q^^\ re- 
spectively for k odd and even, coming from f l5.46p . Also, for k = 0, 

Again, the term in the parentheses is the estimate for 1 — g^*") coming from fl5.45p . 

In any event, applying another derivative to the term 9f (1 — g^**)) will change our 
estimate at most by a factor r^n'^^"'^)/^, which is consistent with our inductive step. 

In the second case, we use the expansion fl5.37p : di can either hit p^-"" or one of 
the dl^p. In this first sub-case, 

\d,p^''"{B)\ = {n-w) \p^-'"-\e) d.${e)\ < Cn\e\ < Cn(^+^)/^ (5.52) 

via f l5.47p . As for the second sub-case, without loss of generality, we assume the 
worst-case estimate for dl^p on that is, 

|af..«.)|.{^;^i^ (5,53) 

This implies that increasing by one the order of the derivative in the l.h.s. of f l5.53p 
will worsen our most conservative estimate at most by a factor n^^"^)/^. Considering 
f l5.52p as well, we conclude that applying another derivative to S'p" will change its 
estimate at most by a factor n^^+^^Z^, which is again consistent with our inductive 
step. Q.E.D. 

Remark 5.7 The careful reader might worry that the unrigorous use of the symbol 
C for a generic constant may jeopardize the above proof. It does not, since all 
the constants that have been used do not depend on r or n. In principle, they may 
depend on k (though it is easy to see that they do not), or i, or j^, but these integers 
only take on a finite number of values, so bounds can be found that do not depend 
on any of the variables. 
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5.5 Stage 2: Removing (E3) 

If, contrary to assumption (E3), v = —zVp{0) ^ 0, cf. fl5.30p . we define b^^^ G 
to be the (not necessarily unique) lattice point for which best approximates 

f e M'^. One clearly has 



c{«) 



n 



1 

- 2^' 



(5.54) 



where the subscript i denotes, as usual, the i-th component of a d-dimensional 
vector. Now, for 6 G (— vr, tt)"^, set 

^„(0) :=pr(0)e-<'5^"'-^)/'^. (5.55) 

We want to interpret generally discontinuous function T'^ — > C. On the 

other hand, vr" is a smooth function of T'^ and, by fl5.24p and Lemma lA.ll of the 
Appendix, 



A 



A 



A 



A 



(5.56) 



(having used notation (]A.9|) as well). Comparing the above with fl5.25p . in view of 
our goal fl5.26p . we see that it is sufficient to repeat all the estimations of Sections 
I5.3H5.4[ replacing p with 7r„. This is no problem, except for estimate fl5.47p — which is 
also refiected in f l5.52p and f l5.53p . (Consider Remark 15.11 and the fact that Lemmas 
15.21 and [575] cannot distinguish between p and 7f„.) 

In order to find an effective substitute for (15.470 . we write, for 6 G Bn, 

d^MO) = ^^nniO) + Uni9') ■ 6, (5.57) 

where Un{0') is the z-th row of the Hessian of 7f„ evaluated at some 9' G This 
has a finite limit, for — )■ oo, as one can easily verify by direct computation on 
f l5.55p (it is in fact, up to a minus sing, the i-th row of the covariance matrix of p). 
Therefore 

\un{0') ■ e\ < c\e\ < ^5_5g) 

On the other hand, by fl5.54p . 



An) 



n 



5. 



(n) 



Vi - 



n 



< Cn 



-1 



Thus, using fl538|) -f l539|) in flS^Tj) . 



(5.59) 



(5.60) 



which is the same bound as fl5.47p . This proves that the if'^-norm of g^"^^ can be 
estimated as in Section WM even when p is replaced by 7r„. That is, (15. 2p holds even 
when (E3) does not, which completes Stage 2. 
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5.6 Stage 3: Removing (El) and (E2) 

It is easy to realize that the convergence rate in (15.21) is not only independent of the 
choice of Q C 5*0, it is also independent of the fact that Q is contained in 5*0, as long 
as it remains an element of the countable partition associated to ^-m,o- In fact, if 
we take Q C Sa, with a 7^ 0, we can shift, via the natural action of Z"' onto M, both 
Q and F by the quantity —a. f l5.2p continues to hold with the same convergence 
rate because all the properties of F that were used in Stages 1 and 2 are translation 
invariant. 

In formula, there exists a positive vanishing sequence neN such that, if 

g = 1q and Q is a fundamental set of SS-m^-, 

H{FoT-)g)-Jl{F)fi{g)\ < hh^^^r^. (5.61) 

Since fl5.6ip depends continuously on G L^, it is immediate to extend it to (7 = 
J2jeN^j^Qj^ with Qj > 0, that is, to a generic positive function in L^(A^, .^_m,05 /^)- 
If g is such that both the positive part g~^ and the negative part g~ are nonzero, we 
apply f l5.6ip twice to g^ and g^. An easy estimate proves that the formula holds in 
this case as well. 

Therefore (M5) holds w.r.t. G := {F e L°°{M, £/s, jJ^) I 371(F)} and Cm ■= 
L^{M,^-m,o,(J'), for all m e N. 

Now, if F e Qm and g G Cm, the invariance of fi and Lemma [23] give that 

f^i{FoT-)g)--p{F)M = 
= fi{{FoT"-'^){goT-'^))-ji{FoT^)i2{goT~"'). (5.62) 

Since g o y-™ £ and F o T"^ G G (because ^o,2m C ^/s), "we apply the previous 
result and see that (a) holds with a convergence rate '^n-2m- 

5.7 Stage 4: Proof of the remaining assertions 

Statement (a) immediately implies (M4) relative to Qm and Cm (Proposition 13. ip . 
One readily extends it to Q and C, thus proving (b), by means of the following 
obvious lemma: 

Lemma 5.8 If Q' is a dense subset of Q in the L°°-norm and C is a dense subset 
of C in the L^-norm, then (M4) for Q' and C implies (M4) for Q and C. 

As concerns (c), it is easy to verify that Proposition 13.21 applies to the classes of 
global observables Qm and local observables Cm (using the family of local observables 
ga '■= Gls^)- Therefore (a) implies (M2) relative to Qm- We extend it to Q by 
means of another obvious result. 

Lemma 5.9 If Q' is a dense subset of Q in the L°°-norm, then (M2) forQ' implies 
(M2) forQ. 
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Finally, let us consider (d). By the second part of Proposition 13. it suffices to 
show that, if F, G G Qm and F is Z'^-periodic, then 7i((-F o T'^)G) exists for n large 
enough. By the same arguments as in the proof of Lemma [2. 4[ when V M., 

fivHF o T")^) = fivHF o T"-'")(G' o T-")) + o(l). (5.63) 

So we can reduce to proving the existence of the infinite-volume limit of the above 
r.h.s., for all n > 2m. Since GoT~"^ is measurable w.r.t. J^-2m.o C =24) with a slight 
abuse of notation we can define, for a G Z'^, 

ba:= [ GoT-"'dn= [ GoT-"'{a;y2)dy2. (5.64) 



An analogous definition can be made for F o T" which is measurable w.r.t. 



(Z ^s- In this case, notice that F o T" is also Z'^-periodic, so we can write 



:= [ FoT"dii= [ FoT"{y^)dyi. 

JSa Jo 



(5.65) 



Clearly, then, 

^ (FoT"-™)(G'oT-™)ci/x = a6« (5.66) 



and, for V = Uaes^,. -^"^ 

f^vHF o T"-™)(G o T-™)) = " (5.67) 

Since 7l(G) exists, by Lemma [23] 

jl(G o T-) = hm ^ 5^ 6« (5.68) 

exists and the limit is uniform in 7. Also, it is obvious that /i(FoT"'~™) = a. Hence, 
as \^ / Al, the r.h.s. of (15:671) tends to Jl{F o T'^-"')Jl{G o T"™), which is what we 
wanted to prove. 

This concludes the proof of Theorem 14.61 Q.E.D. 



A Appendix 

We collect here a few technical results which would have been distracting in the 
body of the paper. The most important of them is an estimate on a certain Fourier 
norm that is pivotal in the proof of Theorem 14.61 This is presented in Section IA.3I 
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A.l Proof of Lemma [472] 

Since T is an automorphism, it is no loss of generality to prove the assertion for 
t = —1. Also, since (Al) is invariant for the action of Z"^ on V, we may assume 
that all V & are of the form Vr := -Bo,r x [0, 1)^. Thus, the infinite-volume limit 
becomes the limit r — )■ oo. 

Let r' = r'{r) := [r^/^] ([■] is the integer part of a positive number) and 

^{r'):=^i{TS,\Vr>)= Yl PP- 

Clearly, v'(r') \ 0, as r' and r tend to infinity. This and the translation invariance 
of T imply that 

/i(TK\K+r') <MK)<^(rO, (A.2) 

whence 

^i{TVr U Vr) < /i(K+r') + ^l{TV, \ K+r') = (2r + 1)'^ + o((2r + 1)'^). (A.3) 

With a dual argument, considering that /x(K--r' \ TVr) = iJ,{T~^Vr-r' \ K) and that 
acts essentially as T (after a swapping of the coordinates yi and 1/2, the map 
T^^ becomes of the same type as T), we obtain 

fi{TVr n Vr) > fi{Vr^r') ' f^iV-^r' \ TV) = (2r + l)"^ + o((2r + l)"^). (A.4) 

Taking the difference of ([Qj) and (1^74]) yields (Al) with t = -I. Q.E.D. 



A.2 Proof of Lemma 14.51 

We must show that, if G Z^r with j' 7^ j", then 

span^{/3(^-) - = span^j/J^^') - (A.5) 
The generic element of the l.h.s. of (lA.SP is 

7 = > >.(/3^'^ - = > >./3^''^ - I y^^. 1 (A.6) 



j:n,(/3(^-)-/3(^-')) = J]n,/3(^-)-[5:n, 



where {nj}j^ji are free variables, i.e., are arbitrarily chosen integers. Upon defining 
— ^j^ which imphes nj" = — flA.6P becomes 



7 = 

3 



which is the generic element of the r.h.s. of (lA.Sp . if we consider {nj}j-^jii to be the 
free variables and nj" to depend on them. Q.E.D. 
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A. 3 Absolutely convergent Fourier series 

In this section we present a convenient estimate for the space A of functions a : 
T'^ — C with an absolutely convergent Fourier series {a/3} = a [K, §6]. This 
functional space is defined as the maximal domain of the norm 



(A. 



This norm has a couple of straightforward invariances. For 7 G Z'^, C e T"^, and 
a E A, let 



(A.9) 
(A.IO) 



{r^am := a{e + 0. 
Lemma A.l Given a G A, for all 7 G Z*^ and C G T'^, 

||aa;^||^ = ||T<^ctm = ||ct||yt- 



Proof. Trivial verification upon computation of the Fourier series of au-y and T(^a. 
Q.E.D. 

The following estimate is a modification — mostly, a simplification — of a 1984 
result by Nowak |No] . The proof, which we give for completeness, is practically 
copied from that article. 

Lemma A. 2 Let v = [d/2] + 1 be the smallest integer strictly bigger than d/2. 
There exists a constant Cd> such that 



d\\U'\\H'' 



where 



\a\\HC' 



+5: El"? 



i=l 



a{9)d9 



1/2 



(0) 



de 



Proof. Let a = (cri, (T2, . . . , a^) be a permutation of {1,2, ... ,d). Let us define 



Clearly, 



:= {(/3i,/32, ...,/?,) G Z'^ I < < ■ ■ ■ < \/3J} 

z' = {o}u|Jz^, 



(A.ll) 
(A.12) 
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although the union is not disjoint. Using the Cauchy-Schwartz inequahty, 



where we have denoted 



/3eZ<T |ai|>0 |Q!2|>|ai| l^d-l I >l"d-2l \<^d\>\<^d-l\ 

<-cY. E ■■■ E 

|oi|>0 |«2|>|ai| |ad-l|>|ad-2| 
|oi|>0 



(as in Section m C represents a generic constant). In view of flA.12p . summing the 
square root of flA.lSp over all the permutations a, we obtain 



d 



J2 < id-i)\^,j2\Yl I • (A.15) 

0ytO i=l 




whence the assertion of the lemma. Q.E.D. 
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